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Generating and ranking of Dyck words 



Abstract. A new algorithms to generate all Dyck words is presented, 
which is used in ranking and unranking Dyck words. We emphasize the 
importance of using Dyck words in encoding objects related to Catalan 
numbers. As a consequence of formulas used in the ranking algorithm 
we can obtain a recursive formula for the nth Catalan number. 



1 Introduction 

Let B = {0, 1} be a binary alphabet and X1X2 . . . x n 6 B n . Let h : B — > {—1 , 1} be 



a valuation function with h(0) = 1 , h(1 ) = — 1 , and H(xiX2 . . . x n ) = 2_ M x 0- 



i=1 

A word X1X2 . . . X2n € B 2n is called a Dyck word [1] if it satisfy the following 
conditions: 



n is the semilength of the word. 
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n 




K(xiX2 . . . Xi) > 0, for 1 < i < 2n — 1 
h.(xiX2...x 2n ) = 0. 
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2 Lexicographic order 

The algorithm that generates all Dyck words in lexicographic order is obvious. 
Let us begin with in the first position, and add or 1 each time the Dyck- 
property remains valid. In the following algorithm 2u is the length of a Dyck 
word, no counts the 0s, and ni the Is. 
There are the following cases : 

Case 1: (no < n) and (ni < n) and (no > ni ) (We can continue by adding and 1.) 
Case 2: (nO < n) and (m < n) and (n = ni ) (We can continue by adding only.) 
Case 3: (no < u) and (ni = n) (We can continue by adding only.) 

Case 4: (no = n) and (ni < n) (We can continue by adding 1 only.) 

Case 5: (no = ni = n) (A Dyck word is obtained.) 

Let us use the following short notations: 



Dyck for 
Xi :=0 
no := no + 1 

LexDyckWords(X, i, no, ni ) 
n := n - 1 

The algorithm is the following: 



Dyck 1 for 
Xi := 1 
ni := ni + 1 

LexDyckWords(X, I, no, ni ) 
ni := ni — 1 



LexDyckWords(X, i, no, ni ) 

1 if Case 1 

2 then i := i + 1 

3 Dyck 

4 Dyck 1 

5 if Case 2 or Case3 

6 then i := i + 1 

7 Dyck 

8 if Case 4 

9 then i := i + 1 

10 Dyck 1 

11 if Case 5 

12 then Visit 

13 return 



The recursive call: 

X] = 0, no := 1 , ni := 
LexDyckWords(X, 1 , n , ni ) 
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For n = 4 the following result: 

00001111, 00010111, 00011011, 00011101, 00100111, 00101011, 00101101, 
00110011, 00110101, 01000111, 01001011, 01001101, 01010011, 01010101. 

This algorithm obviously generates all Dyck words. 

3 Generating the positions of Is 

Let bib 2 . . .b n be the positions of Is in the Dyck word xix 2 ■ ■ . x 2n . E.g. for 
Xl x 2 ...x 8 = 01010011 we have b,b 2 b 3 b4 = 2478. 

To be a Dyck word of semilength n, the positions bib 2 . . .b n of Is of the 
word Xix 2 . . .X2 n must satisfy the following conditions: 

2i < bt < n + i, for 1 < t < n. 

Following the idea of generating combinations by positions of 0s in the 
corresponding binary string |5] we propose a similar algorithm that generates 
the positions bib 2 . . . b n of Is. 

PosDyckWords (n) 



1 for i := 1 to n 

2 do bi := 2i 

3 repeat 

4 Visit b]b 2 ...b n 

5 ITMD := 

6 for i := u — 1 downto 1 

7 do if bi < n + i 

8 then bi := b t + 1 

9 for j := i + 1 to n — 1 

10 do bj := max(bj_i + 1,2j) 

11 IND := 1 

12 break (for) 

13 until IND = 

14 return 



For n = 4 the following result: 

2468, 2478, 2568, 2578, 2678, 3468, 3478, 3568, 3578, 3678, 4568, 4578, 4678, 
5678 
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The corresponding Dyck words are: 

01010101, 01010011, 01001101, 01001011, 01000111, 00110101, 00110011, 
00101101, 00101011, 00100111, 00011101, 00011011, 00010111, 00001111 

Because all values of positions that are possible are taken by the algorithm, it 
generates all Dyck words. Words are generated in reverse lexicographic order. 

4 Generating by changing 10 in 01 

The basic idea [2] is to change the first occurence of 10 in 01 to get a new 
Dyck word. We begin with 0101 ... 01 . 

Let us denote by X the Dyck word x-\xi ■ ■ ■ X2 n . 

DyckWords(X, k) 

1 i:=k 

2 while i < 2n 



3 do Let j be the position of the first occurence of 10 in XiXi+i . . .xi 

or if such a position doesn't exist. 

4 if j > 

5 then Let Y := X 

6 Change yi with ut + i . 

7 Visit yi-y 2 ...y2n 

8 Dyck Words ( Y, j - 1) 

9 i := j + 2 



10 return 

The first call is DyckWords(X, 1), if X = 0101 . . . 01 . 

For X = 01010101, the algoritm generates: 

01010101, 00110101, 00101101, 00011101, 00011011, 00010111, 00001111, 
00101011, 00100111, 00110011, 01001101, 01001011, 01000111, 01010011. 

Can this algorithm always generate all Dyck words? To prove this we show 
that any Dyck word can be tranformed to (01 ) n by several changing of 01 in 
10. Let us consider the leftmost subword of the form l ^ , for i > 0. Changing 
01 in 10 (i — 1 ) times, we will obtain a leftmost subword of the form i_1 1 . 
So, all subwords of this form can be avoided. 
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5 Ranking Dyck words 

Ranking Dyck words means [6] to determine the position of a Dyck word in a 
given ordered sequence of all Dyck words. 

Algorithm PosDyckWords generates all Dyck word in reverse lexico- 
graphic order. For ranking these words we will use the following function 
[7], where f(i,j) represents the number of paths between (0,0) and (i, j) not 
crossing the diagonal x = y of the grid. 

' 1 , for < i < n, j = 

*Ui\-l +f(bj - 1), for1<j<i<n 

1 MJ ~ j f(i,i-1), for1<i = j<u lj 

^ 0, for < i < j < u 

Some values of this function are given in the following table. 
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It is easy to prove that if C n is the nth Catalan number then 

n 

C n+1 =f(u+1,n) = £_f(n,i), n>0 (2) 

i=0 

k 

f(u+ 1,k) = Y f(n,i), n>0,n>k>0. 

i=0 

Using this function the following ranking algorithm results. 
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Ranking (bib2 . . .b n ) 



1 ci := 2 

2 for j := 2 to n 

3 do Cj := max(bj_i + 1,2j) 

4 nr := 1 

5 for 1 := 1 to n — 1 

6 do for j := C{ to bi — 1 

7 do nr := nr + f(n — i, n + 1 — j) 



8 return nr 

For example, if b =4 5 8 9 10, we get c =2 5 6 9 10, and nr = 1 + f(4,4) + 

f(4,3) + f(2,2)+f(2,1) = 1 +14 + 14 + 2 + 2 = 33. 

This algorithm can be used for ranking in lexicographic order too. 

6 Unranking Dyck words 

The unranking algorithm for a given n will map a number between 1 and C n 
to the corresponding Dyck word represented by positions of Is. Here the Dyck 
words are considered in reverse lexicographic order too. 

Unranking (nr) 

1 b :=0 

2 nr := nr — 1 

3 for i := 1 to n 



4 do bi := max (bi_i + 1 , 2i) 

5 j := n + i — bi 

6 while (nr > f(n — 1, j)) and (bi < n + i) 

7 do nr := nr — f(n — i, j) 

8 bi := bi + 1 

9 j:=j-1 



10 return b]b2 .. .b n 

If n = 6 and nr = 93, we will have: 92 - f (5, 5) - f (5, 4) - f (3, 3) - f (2, 2) - 
f (1 , 1 ) = 92 — 42 — 42 — 5 — 2 — 1 , so the corresponding Dyck word represented 
by positions of l's is: b = 4 5 7 9 11 12. Are changed from the initial values 
21 the following: position 1 by 2, position 3 by 1, position 4 by 1 and position 
5 by 1. 
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7 Applications of Dyck words 

If O is a set of C n objects, Dyck words can be used for encoding the objects of 
O. The importance of such an encoding currently is not suitably accentuated. 
We present here an encoding and decoding algorithms for binary trees, based 
on [I]. 

Algorithm for encoding a binary tree 

Let Bl be the left and Br the right subtree of the binary tree B. wOl means 
the concatenation of word w with 01, and w is considered a global variable. 

EncodingBT(B) 



1 if Bl 7^ and B R = 

2 then w := wOl 

3 EncodingBT(B l ) 

4 if B L = and B R / 

5 then w := wlO 

6 EncodingBT(Br) 

7 if B L / and B R / 

8 then w := wOO 

9 EncodingBT(B l ) 

8 w := wll 

9 EncodingBT (B r ) 
10 return 



Call: 
w := 

EncodingBT(B) 
w := wl 

For all trees of n = 4 vertices the result of the algorithm is given in Fig. 1. 
Algorithm to decode a Dyck word into a binary tree 

At the beginning the root of the generated binary tree is the current vertex. 
When an edge is drawn, its endvertex becomes the current vertex. 
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00101011 00101101 00110011 00110101 01001011 01001101 01010011 

01010101 00001111 00010111 00011011 00011101 00100111 01000111 



Figure 1: Encoding of binary trees for n = 4. 
DecodingBT(w) 

1 Let ab be the first two letters of w. 

2 Delete ab from w. 

3 if ab = 01 

4 then draw a left edge from the current vertex 

5 DecodingBT(w) 

6 if ab = 1 

7 then draw a right edge from the current vertex 

8 DecodingBT(w) 

9 if ab = 00 

10 then put in the stack the position of the current vertex 

11 draw a left edge from the current vertex 

12 DecodingBT(w) 

13 if ab = 1 1 

14 then get from the stack the position of the new current vertex 

15 draw a right edge from the current vertex 

16 DecodingBT(w) 

17 return 



Call: 

delete from the beginning and 1 from the end of the input word w 
draw a vertex (the root of the tree) as current vertex 
DecodingBT(w) 
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For some other objects related to Catalan numbers the corresponding coding 
can be found in [1] and at http:/ /www. ms.sapientia.ro/~kasa/CodingDyck.pdf, 



8 A consequence 

As a consequence of formulas ([I]) and §2^ the following formula for the (n+1 )th 
Catalan number results: 

C n+ i =1+^(-D k (!'"! C )c n _ k . (3) 

k>0 

We can prove that 

n /v _ "\ 

f(n,n-k) = Jj-1) l ( r'jCn-i 
i=0 \ 1 / 

for appropriate n and k, using mathematical induction on n and k, and formula 
([I]) in the form 

f(u,n-k) = f(n,n-k + 1 ) - f (n - 1 , u - k + 1 ). 

Now, from Q 

n n n— 1 

Cn+i = ^f(n,i) = f(n,0) + ^f(n,i) = 1 +^f(n,n-i) 

i=0 i=1 i=0 

n— 1 / n 



i=0 \k=0 



k 

n-l 



k=0 V i=0 



n — k 



k=0 v 7 

In the last line (J) + f^ 1 ) + • • • + ( n ^ k ) = (£; k ) has been used. 
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